Learning Microbial Interaction Networks from Metagenomic Count Data

نویسندگان

  • Surojit Biswas
  • Meredith McDonald
  • Derek S. Lundberg
  • Jeffery L. Dangl
  • Vladimir Jojic
چکیده

Many microbes associate with higher eukaryotes and impact their vitality. To engineer microbiomes for host benefit, we must understand the rules of community assembly and maintenance that, in large part, demand an understanding of the direct interactions among community members. Toward this end, we have developed a Poisson-multivariate normal hierarchical model to learn direct interactions from the count-based output of standard metagenomics sequencing experiments. Our model controls for confounding predictors at the Poisson layer and captures direct taxon-taxon interactions at the multivariate normal layer using an ℓ1 penalized precision matrix. We show in a synthetic experiment that our method handily outperforms state-of-the-art methods such as SparCC and the graphical lasso (glasso). In a real in planta perturbation experiment of a nine-member bacterial community, we show our model, but not SparCC or glasso, correctly resolves a direct interaction structure among three community members that associates with Arabidopsis thaliana roots. We conclude that our method provides a structured, accurate, and distributionally reasonable way of modeling correlated count-based random variables and capturing direct interactions among them.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Liberation from equations: An equation-free method reveals the ecological interaction networks within complex microbial ecosystems

19 Mapping the network of ecological interactions is key to understanding the composition, stability, 20 function and dynamics of microbial communities. These ecosystem properties provide the 21 mechanistic basis for understanding and designing microbial treatments that attempt to promote 22 human health and provide environmental services. In recent years various approaches have been 23 used to...

متن کامل

Identifying Keystone Species in the Human Gut Microbiome from Metagenomic Timeseries Using Sparse Linear Regression

Human associated microbial communities exert tremendous influence over human health and disease. With modern metagenomic sequencing methods it is now possible to follow the relative abundance of microbes in a community over time. These microbial communities exhibit rich ecological dynamics and an important goal of microbial ecology is to infer the ecological interactions between species directl...

متن کامل

Correction: Class Prediction and Feature Selection with Linear Optimization for Metagenomic Count Data

The amount of metagenomic data is growing rapidly while the computational methods for metagenome analysis are still in their infancy. It is important to develop novel statistical learning tools for the prediction of associations between bacterial communities and disease phenotypes and for the detection of differentially abundant features. In this study, we presented a novel statistical learning...

متن کامل

mLDM: a new hierarchical Bayesian statistical model for sparse microbial association discovery

Interpretive analysis of metagenomic data depends on an understanding of the underlying associations among microbes from metagenomic samples. Although several statistical tools have been developed for metagenomic association studies, they suffer from compositional bias or fail to take into account environmental factors that directly affect the composition of a given microbial community. In this...

متن کامل

A Poisson-multivariate normal hierarchical model for measuring microbial conditional independence networks from metagenomic count data

1 Department of Statistics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA 2 Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA 3 Howard Hughes Medical Institute, University of North Carolina, Chapel Hill, NC, 27599, USA 4 Carolina Center for Genome Sciences, University of North Carolina, Chapel Hill, NC, 27599, USA 5 Departme...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of computational biology : a journal of computational molecular cell biology

دوره 23 6  شماره 

صفحات  -

تاریخ انتشار 2015